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ABSTRACT 


In  this  report,  a  mathematical  model  of  a  data 
communications  system  using  retransmission  error 
control  is  developed.  System  performance  is  spec¬ 
ified  by  information  throughput,  the  amount  of  error- 
free  data  transferred  per  unit  of  time.  Information 
throughput  is  determined  as  a  function  of  message 
length,  taking  into  account  transmission  delay,  chan¬ 
nel  error  rate,  and  overhead  words  in  the  data 
message. 
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SECTION  I 


INTRODUCTION 


In  digital  data  communications,  error  detection  and  retransmission  is 
one  method  of  obtaining  essentially  error-free  transfer  of  data  over  a  bursty 
channel  between  a  data  source  and  a  remotely  located  data  sink.  A  perform¬ 
ance  measure  of  a  data  communications  system  is  information  throughput , 
the  amount  of  error-free  data  transferred  per  unit  of  time.  In  this  report,  a 
previously  developed  mathematical  model^^  is  extended  to  permit  determina¬ 
tion  of  information  throughput  as  a  function  of  message  length,  taking  into 

account  transmission  delay,  channel  error  rate,  and  overhead  words  in  the 

[2] 

data  message.  Kuhn  ,  who  previously  examined  this  problem,  did  not 
include  in  his  analysis  the  effects  of  transmission  delay  or  overhead  words. 

The  data  communications  system,  the  mathematical  model  of  the  system 
for  which  the  throughput  is  calculated,  the  error  characteristics  of  the  channel, 
and  the  graphical  solutions  to  the  throughput  equation,  which  permits  determina¬ 
tion  of  optimum  message  length  for  maximum  throughput  under  changing 
operating  parameters  of  channel  error  rate,  delay,  and  message  overhead, 
are  described  in  this  report. 
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SECTION  n 


THE  DATA  COMMUNICATIONS  SYSTEM 


A  functional  block  diagram  of  a  typical  data  communications  system 
is  shown  in  Figure  1.  A  data  source  sends  a  sequence  of  digital  information 
via  a  noisy  channel  to  a  remotely  located  sink.  The  data  are  encoded  into 
information  blocks  called  words.  The  data  words  are  assembled  into  message 
blocks  containing  N  words.  A  data  word  is  considered  in  error  if  one  or 
more  errors  appear  in  the  received  data  word.  A  message  is  considered  in 
error  if  one  or  more  of  the  N  words  in  the  message  are  received  in  error. 

The  sink  is  assumed  to  have  the  capability  always  to  detect  errors  in 
the  received  data.  This  is  a  reasonable  assumption  since  very  powerful 
error-detecting  codes  are  available.  If  a  message  in  error  is  detected,  the 
sink  requests  the  source  to  go  back  in  the  data  sequence  to  the  message 
received  in  error  and  recommence  transmission.  The  sink  then  ignores  all 
incoming  messages  until  the  retransmitted  data  message  is  received  cor¬ 
rectly. 

The  time  interval  between  the  detection  of  a  message  in  error  and 
reception  of  the  retransmission  is  the  sum  of  the  detection  time  and  the 
round-trip  transmission  time  of  the  communications  system.  The  error- 
detection  time  is  the  time  it  takes  for  the  sink  to  receive  the  message  or 
N  word  intervals.  The  round-trip  transmission  time  of  the  communica¬ 
tions  system  is  D  word  intervals. 

Of  the  N  words  in  each  message,  I  words  are  overhead,  i.e.,  they 
carry  no  information  to  the  data  sink.  These  I  words  are  used  to  identify 
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NOISE 


Figure  1.  Block  Diagram  of  the  Data  Communications  System 


the  message,  provide  error  detection,  and  maintain  system  synchronization. 
The  remaining  N-I  words  of  the  message  contain  the  useful  information 
transferred  over  the  communications  system. 
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SECTION  m 


ERROR  CHARACTERISTICS  OF  THE  CHANNEL 


Error  measurements  on  various  communication  media,  such  as  high- 
frequency  radio  and  troposcatter,  indicate  that  these  communications 
channels  are  bursty,  i.e.,  the  error  occurrences  are  highly  correlated. 


Attempts  have  been  made  to  model  these  channels  and  to  list  their  relevant 
statistical  characterise 
channel  error  patterns. 


[3  4] 

statistical  characteristics  ’  in  order  to  describe  the  behavior  of  the 


A  quantity  commonly  used  to  describe  the  channel  is  average  word- 
error  rate,  P.  This  number  is  found  by  averaging  the  number  of  words  in 
error  in  a  fixed  interval  of  the  data  stream  over  the  total  number  of  words 
in  the  interval.  For  example,  if  10  words  were  in  error  in  an  interval  of  104 
words,  the  average  word-error  rate  would  be  10  divided  by  104  or  10-3. 

For  a  channel  with  random,  uncorrelated  error  patterns,  average  word- 
error  rate  is  a  good  measure  of  transmission  quality.  For  bursty  channels, 
however,  average  word-error  rate  is  not  a  meaningful  measure  unless 
additional  higher  order  probabilities  describing  the  random  channel  behavior 
are  given. 

In  this  report,  however,  average  word-error  rate  alone  is  used  to 
illustrate  the  method.  Higher  order  probability  terms  would  vary  the 
calculated  results,  but  not  the  method  employed. 
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SECTION  IV 


THE  MATHEMATICAL  MODEL 


GENERAL 

The  data  transmission  system  attempts  to  send  messages  over  the 
channel.  Errors  occur  in  the  messages  at  an  average  rate,  P,  and  cause 
retransmissions  which  interrupt  the  flow  of  information. 

Expressions  for  the  average  length  of  a  sequence  of  error-free  data 
messages,  the  average  number  of  retransmissions  necessary  to  receive  an 
error-free  message,  and  the  information  throughput  of  the  communications 
system  are  derived  below. 

AVERAGE  LENGTH  OF  AN  ERROR-FREE  SEQUENCE 

Let  P  equal  1-P,  where  P  is  the  average  word-error  rate.  From 
the  theory  of  combinatorial  analysis,  it  can  be  shown  that  the  probability  of  a 
sequence  of  n  words  being  transmitted  without  any  errors,  L(n),  is  given  by 


L(n)  =  P  P 


(1) 


The  average  sequence  length,  n,  is  given  by 


00 


n  L(n)  . 


(2) 


n=  0 


Substituting  (1)  into  (2)  gives 


00 


(3) 
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Equation  (3)  reduces  to 


P  P  P 


(1-P)2  P 


(4) 


Therefore,  the  average  length  of  the  errorless  sequence  is  P/P  words. 

AVERAGE  NUMBER  OF  RETRANSMISSIONS  WITH  ERRORS 

From  the  mathematics  of  combinatorial  analysis,  it  can  be  shown  that 
the  probability  of  x  retransmissions  to  receive  an  error -free  retrans¬ 
mission  K(x)  is  given  by 


x 


K(x)  =  P  P 


(5) 


The  average  number  of  retransmissions  to  receive  an  error -free  retrans¬ 
mission  is  given  by 


CO 


(6) 


x=0 


Substituting  (5)  into  (6)  yields 


00 


(?) 


x=0 


Equation  (7)  is  similar  to  Equation  (3)  and  reduces  to 


P 


(8) 


x 


P 


From  Equation  (4),  therefore, 


1 


(9) 


x 


n 
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TRANSMISSION  OF  N  WORD  MESSAGES 


When  data  are  transmitted  in  groups  of  N  words,  all  N  words 
must  be  received  without  errors  or  a  retransmission  is  requested.  The 
probability  of  successful  transmission  P(N)  is  given  by 


-  -n 
P(N)  =  P 


(10) 


The  probability  that  the  N  word  block  contains  at  least  one  error,  P(N), 
is  given  by 


— N 


(ID 


P(N)  =  1-P 


This  relationship  is  plotted  in  Figure  2,  which  shows  P(N)  as  a  function  of 
N  and  P  or  1-P. 

The  average  number  of  blocks  transmitted  without  errors  is  represented 
by  n^,  and,  in  order  to  receive  a  message  which  is  retransmitted  without 
errors,  the  average  number  of  retransmissions  of  N  word  blocks  is  rep¬ 
resented  by  .  The  transmission  with  errors  which  initiated  the  retrans¬ 
mission  sequence  is  not  included  in  x^.  Substituting  Equations  (9)  and  (10) 
into  Equations  (4)  and  (8)  yields 


n. 


N 


— N 


(12) 


1-P 


and 


(13) 
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1A  -  IS, 34  7 


Figure  2.  Probability  Error  of  N  Word  Messages  P(N)  Versus  N  and 
Word-Error  Probability  P 
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INFORMATION  THROUGHPUT 


Transmission  of  an  error-free  message  results  in  N  data  words 
being  accepted  by  the  sink.  An  error-free  sequence  of  n  messages  results 
in  N  n  data  words  being  accepted  by  the  sink.  A  retransmission  caused  by 
the  occurrence  of  an  error  and  the  termination  of  a  sequence  of  errorless 
messages  results  in  a  transmission  interruption  of  (D+  N)  words,  which  is 
the  sum  of  the  round-trip  transmission  time  and  the  error-detection  time. 
The  average  interruption  duration  is  (D  +  N)  times  the  average  number  of 
times  (1  +  x)  that  a  message  must  be  transmitted  and  retransmitted  in  order 
to  be  received  without  errors. 


Information  throughput,  T  ,  is  defined  as  the  amount  of  error-free 
information  transferred  per  unit  time.  This  quantity  is  the  ratio  of  the 
average  information  transferred  between  retransmissions  to  the  informa¬ 
tion  which  would  be  passed  if  there  were  no  retransmissions.  Thus, 


T 


N 


Nn 

Nn  +  (D+N)  (1+x) 


(14) 


Substituting  Equations  (11)  and  (12)  into  Equation  (13)  yields 

N  PN/ 1  -  PN 

tn  = 


N  (  PN/  1-PN  )  +  ^  D  +Nj[l  +  (  1-PN/ PN)  ] 


(15) 


Equation  (14)  may  be  rewritten  as 


T  = 
N 


-2N 
N  P 


Nfp2N)  +  (d+n)  (l-PN) 


(16) 
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Since  each  N  word  message  contains  I  overhead  words,  which 
results  in  a  relative  fraction,  R,  of  noninformation-carrying  words  per 
message  where 


R 


N  ’ 


(17) 


then  the  actual  information  throughput,  T,  is  given  by 


T  =  Tn  (1-R)  . 

Substituting  Equations  (15)  and  (16)  into  Equation  (17)  yields 

N  P2N  (  1  -  Vn  ) 

N  P2N  +(d  +  n)  (  1  -  PN) 


(18) 


(19) 


Equation  (18)  has  been  calculated  by  a  computer  using  the  values  of  P, 

N,  D,  and  I  shown  in  Table  I.  Figures  3  through  18  are  plots  of  Equation 
(18).  Each  plot  shows  information  throughput,  T,  versus  message  length, 
N,  for  various  values  of  P  and  constant  values  of  D  and  I. 


By  inspecting  Figures  3  through  18  it  is  obvious  that  for  given 
values  of  P,  D,  and  I  there  is  a  unique  message  length  that  maximizes 
the  information  throughput.  Message  lengths  greater  or  less  than  the 
optimum  will  result  in  decreased  throughput.  Tables  II  through  VI  show 
the  optimum  message  length  to  yield  maximum  throughput  versus  P 
for  various  values  of  D  and  I. 
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Table  I 


Values  of  P,  N,  D,  and  I 
Used  in  Computer  Solution  of  Equation  (19) 
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Figure  3.  Information  Throughput  T  Versus  Message  Length  N  and  Probability 
of  Error  P 
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Figure  4.  Information  Throughput  T  Versus  Message  Length 
of  Error  P 


N  and  Probability 
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Figure  5,  Information  Throughput  T  Versus  Message  Length  N  and  Probability 
of  Error  P 
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Figure  6. 


Information  Throughput  T  Versus  Message  Length  N  and  Probability 
of  Error  P 
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Figure  7.  Information  Throughput  T  Versus  Message  Length 
of  Error  P 


N  and  Probability 
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Figure  8.  Information  Throughput  T  Versus  Message  Length  N  and  Probability 
of  Error  P 
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Figure  9.  Information  Throughput  T  Versus  Message  Length  N  and  Probability 
of  Error  P 
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Figure  10.  Information  Throughput  T  Versus  Message  Length 
of  Error  P 


N  and  Probability 
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Figure  11.  Information  Throughput  T  Versus  Message  Length  N  and  Probability 
of  Error  P 
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Figure  12,  Information  Throughput  T  Versus  Message  Length  N  and  Probability 
of  Error  P 


-21- 


00 

ID 

ro 

CO 

I 

< 


Figure  13.  Information  Throughput  T  Versus  Message  Length  N  and  Probability 
of  Error  P 
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Figure  14.  Information  Throughput  T  Versus  Message  Length  N  and  Probability 
of  Error  P 


-23- 


o 

<£> 

fO 


Figure  15.  Information  Throughput  T  Versus  Message  Length  N  and  Probability 
of  Error  P 
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Figure  16.  Information  Throughput  T  Versus  Message  Length  N  and  Probability 
of  Error  P 
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Figure  17.  Information  Throughput  T  Versus  Message  Length  N  and  Probability 
of  Error  P 
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Figure  18.  Information  Throughput  T  Versus  Message  Length  N  and  Probability 
of  Error  P 
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Table  II 


Optimum  Message  Length  in  Words  for 
F  Equal  to  1-1CT2  Versus  D  and  I 


\l 

D  \ 

1 

2 

4 

8 

10 

8 

16 

16 

32 

40 

8 

16 

16 

32 

160 

8 

16 

16 

32 

640 

8 

16 

16 

32 

Table  IH 


Optimum  Message  Length  in  Words  for 
P  Equal  to  1-10"^  Versus  D  and  I 


\  I 

D  \ 

1 

2 

4 

8 

10 

32 

32 

64 

64 

40 

32 

32 

64 

64 

160 

32 

32 

64 

64 

640 

32 

32 

64 

64 
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Table  IV 


Optimum  Message  Length  in  Words  for 
P  Equal  to  1-10-4  Versus  D  and  I 


\I 

D  \. 

1 

2 

4 

8 

10 

128 

128 

256 

256 

40 

128 

128 

256 

256 

160 

128 

128 

256 

256 

640 

128 

128 

256 

256 

Table  V 

Optimum  Message  Length  in  Words  for 
P  Equal  to  1-10“5  Versus  D  and  I 


\I 

D 

1 

2 

4 

8 

10 

256 

512 

512 

1024 

40 

256 

512 

512 

1024 

160 

256 

512 

512 

1024 

640 

256 

512 

512 

1024 
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Table  VI 


Optimum  Message  Length  in  Words  for 
P  Equal  to  1-10“  Versus  D  and  I 


\I 

d\ 

1 

2 

4 

8 

10 

1024 

1024 

2048 

2048 

40 

1024 

1024 

2048 

2048 

160 

1024 

1024 

2048 

2048 

640 

1024 

1024 

2048 

2048 
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SECTION  V 


CONCLUSIONS 

In  this  report,  a  mathematical  model  of  a  data  communications 
system  using  retransmission  error  control  was  developed.  System  per¬ 
formance  is  specified  by  information  throughput,  the  amount  of  error-free 
data  transferred  per  unit  of  time.  Information  throughput,  T,  is  determined 
as  a  function  of  message  length,  N,  taking  into  account  transmission  delay, 

D,  channel  error  rate,  P,  and  overhead  words  in  the  data  message,  I.  The 
throughput  expression  given  in  Equation  (19)  was  plotted  in  Figures  3  through 
18  for  various  values  of  N,  D,  P,  and  I. 

It  is  obvious  from  inspecting  the  curves  that  throughput  is  dependent  on 
error  rate  and  message  length.  If  the  error  rate  is  10“  or  less,  then  mes¬ 
sage  length  is  very  close  to  optimum  over  a  broad  range  of  values.  This 
optimum  is  also  greater  than  0.95.  Lower  rates  than  10-5  would  bring  the 
optimum  closer  to  1.0.  For  example,  a  nominal  error  rate  of  wire  line  might 
be  10-5.  To  attain  a  lower  error  rate  would  require  forward  acting  error- 
correction  codes  which  would  add  a  fixed  percent  of  redundancy,  C,  to  each 
data  word.  Typical  values  of  C  range  from  0.33  to  0.5.  Since  the  attainable 
increase  in  throughput  is  only  0.05  or  less,  the  use  of  these  codes,  however, 
would  actually  decrease  the  throughput. 

Typical  optimum  throughputs  at  an  error  rate  of  10-r ,  however,  range 
from  0.75  to  0.05.  If  error-correction  coding  can  decrease  the  error  rate 
to  10-5  or  better,  a  net  gain  in  information  throughput  can  result. 
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A  study  of  the  advantages  of  error-correction  coding  is  just  one 
example  of  the  usefulness  of  the  model  presented  in  this  report.  Other  uses 
will  depend  on  the  problems  of  the  communications  system  designer. 
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